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DETAILED ACTION 
Response to Arguments 

1 . Applicant's arguments witli respect to claims 1-30 have been considered but are 
moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

3. Claims 13, 14, 27, and 28 rejected under 35 U.S.C. 102(b) as being anticipated 
by Shriberg et al. "Can Prosody Aid the Automatic Classification of Dialog Acts in 
Conversational Speech?" (hereinafter Shriberg). 

Re claim 13, Shriberg teaches a method of synthesizing speech using discourse 
function level prosodic features comprising the steps of: 

determining output information (Pages 4-5, Why Use Prosody?); 

determining discourse functions in the output information based on a contextually 
aware theory of discourse analysis using a mapping between basic discourse 
constituents of the contextually aware theory of discourse analysis and a plurality of 
discourse functions (Pages 8-13); 

determining a model of discourse function level prosodic features (Pages 14-18); 
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determining adjusted syntliesized speecli output (Pages 4-5, Why Use 
Prosody?) based on the discourse functions and the model of discourse function level 
prosodic features (Pages 14-18). 

Re claims 14 and 28, Shriberg teaches the method of claim 13, In which the 
context is at least one of: semantic, pragmatic, and syntactic context (pages 4-5) 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the phor art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. Claims 1-12, 15-26, and 29-30 rejected under 35 U.S.C. 103(a) as being 
unpatentable over Shriberg et al. "Can Prosody Aid the Automatic Classification of 
Dialog Acts in Conversational Speech?" (hereinafter Shriberg) in view of Chino US 
5761 637 A (hereinafter Chino). 

Re claims 1,15, 29, and 30, Shriberg teaches a method of synthesizing speech 
(Page 5) using discourse function level prosodic features (Pages 14-18) comprising the 
steps of: 

determining output information (Pages 4-5, Why Use Prosody?); 
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determining discourse functions in the output information the discourse functions 
being determined based on a mapping between basic discourse constituents of the 
determined theory of discourse analysis and a plurality of discourse functions (Pages 8- 
13); 

determining a model of discourse function level prosodic features (Pages 14-18); 

determining adjusted synthesized speech output (Page 5) based on the 
discourse functions and the model of discourse function level prosodic features (Pages 
14-18) 

However, determining a theory of discourse analysis from a plurality of theories 
(Chino Col. 1 lines 41-49) of discourse analysis based on the speech to be synthesized 
(Chino Col. 6 lines 13-60 Fig. 7); 

Chino teaches a discourse structure extraction section 7 receives the utterance 
function corresponding to each clue from the utterance function extraction section 5 and 
generates the discourse structure representing the flow of dialogue by referring to a 
structure generation rule memory section 8. The structure generation rule memory 
section 8 previously memorizes structure generation rule. Then, a discourse structure 
memory section 9 stores the discourse structure. (This processing will be explained 
afterward). Chino teaches various discourse structures generated based on the 
utterance function extraction unit. Additionally, Chino teaches the use of Discourse 
Structure Theory or Situation Semantics as part of natural language processing. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention determining a theory of discourse analysis from several theories of 
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discourse analysis for syntliesizing speecli using discourse function level prosodic 
features, where discourse functions are mapped to discourse functions. Using a 
discourse structure or analysis allows for diversification of various types of utterances, 
where conversational speech of different topics can be distinguished from one another 
and learned by the speech processing system. 

Re claims 2 and 16, Shriberg teaches the method of claim 1, wherein the 
discourse functions are determined based on the determined theory of discourse 
analysis (Pages 8-13). 

Re claims 3 and 17, Shriberg fails to teach the method of claim 2, in which the 
theory of discourse analysis (Chino Col. 1 lines 41-49) is at least one of: the Linguistic 
Discourse Model, the Unified Linguistic Discourse Model, Rhetorical Structures Theory, 
Discourse Structure Theory and Structured Discourse Representation Theory (Chino 
Col. 6 lines 13-60 Fig. 7); 

Chino teaches a discourse structure extraction section 7 receives the utterance 
function corresponding to each clue from the utterance function extraction section 5 and 
generates the discourse structure representing the flow of dialogue by referring to a 
structure generation rule memory section 8. The structure generation rule memory 
section 8 previously memorizes structure generation rule. Then, a discourse structure 
memory section 9 stores the discourse structure. (This processing will be explained 
afterward). Chino teaches various discourse structures generated based on the 



Application/Control Number: 10/785,199 Page 6 

Art Unit: 2626 

utterance function extraction unit. Additionally, Chino teaches the use of Discourse 
Structure Theory or Situation Semantics as part of natural language processing. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention determining a theory of discourse analysis from several theories of 
discourse analysis for synthesizing speech using discourse function level prosodic 
features, where discourse functions are mapped to discourse functions. Using a 
discourse structure or analysis allows for diversification of various types of utterances, 
where conversational speech of different topics can be distinguished from one another 
and learned by the speech processing system. 

Re claims 4 and 18, Shriberg teaches the method of claim 1 , wherein the output 
information (Pages 4-5, Why Use Prosody?) is at least one of text information and 
application output information (Pages 8-13). 

Re claims 5 and 19, Shriberg teaches the method of claim 1 , wherein 
determining the adjusted synthesized speech output (Pages 4-5, Why Use Prosody?) 
further comprises the steps of: 

determining a synthesized speech output (Pages 4-5, Why Use Prosody?) based 
on the output information (Pages 8-13); 

determining discourse function level prosodic feature adjustments (Pages 14-18); 
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determining adjusted synthesized speech output based on the synthesized 
speech output (Pages 8-13) and the discourse level prosodic feature adjustments 
(Pages 14-18). 

Re claims 6 and 20, Shriberg teaches the method system of claim 1 , wherein the 
model of discourse function level prosodic features (Pages 14-18) is a predictive model 
of discourse functions (Page 19). 

Re claims 7 and 21 , Shriberg teaches the method of claim 6, in which the 
predictive models are determined based on at least one of: machine learning and rules 
(Page 19). 

Re claims 8 and 22, Shriberg teaches the method of claim 1 , in which the 
prosodic features occur in at least one of a location: preceding, within and following the 
associated discourse function (Page 14). 

Re claims 9 and 23, Shriberg teaches the method of claim 1 , in which the 
prosodic features are encoded within a prosodic feature vector. 

Re claims 10 and 24, Shriberg teaches the method of claim 9, in which the 
prosodic feature vector is a multimodal feature vector (Pages 14-18 & Table 10). 
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Re claims 1 1 and 25, Shriberg teaches the method of claim 1 , in which the 
discourse function is an intra-sentential discourse function (Page 8 & Table 1). 

Re claims 12 and 26, Shriberg teaches the method of claim 1 , in which the 
discourse function is ml inter-sentential discourse function (Page 8 & Table 1). 

Conclusion 

6. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. US 6810378 B2, US 5751906 A, US 58901 17 A, US 
20020078091 A1 , US 20020142277 A1 , US 20040044519 A 1 , US 20050042592 A1 , 
US 20070073533 A1 , US 5732395 A, and Jurafsky et al., "Automatic Detection of 
Discourse Structure for Speech Recognition Understanding", Automatic Speech 
Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop.. 

Conclusion 

7. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 

§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
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mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael C. Colucci whose telephone number is (571)- 
270-1847. The examiner can normally be reached on 9:30 am - 6:00 pm, Monday- 
Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571 )-272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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